Software Benchmark—Classification Tree Algorithms for Cell Atlases Annotation Using Single-Cell RNA-Sequencing Data
نویسندگان
چکیده
Classification tree is a widely used machine learning method. It has multiple implementations as R packages; rpart, ctree, evtree, and C5.0. The details of these are not the same, hence their performances differ from one application to another. We interested in performance classification cells using single-cell RNA-Sequencing data. In this paper, we conducted benchmark study 22 Single-Cell RNA-sequencing data sets. Using cross-validation, compare packages’ prediction based on Precision, Recall, F1-score, Area Under Curve (AUC). also compared Complexity Run-time packages. Our shows that rpart evtree have best Precision; F1-score AUC; C5.0 prefers more complex trees; consistently much faster than others, although its complexity often higher others.
منابع مشابه
A Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data
Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. ...
متن کاملA Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data
Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. ...
متن کاملI-13: Transcriptome Dynamics of Human and Mouse Preimplantation Embryos Revealed by Single Cell RNA-Sequencing
Background: Mammalian preimplantation development is a complex process involving dramatic changes in the transcriptional architecture. However, it is still unclear about the crucial transcriptional network and key hub genes that regulate the proceeding of preimplantation embryos. Materials and Methods: Through single-cell RNAsequencing (RNA-seq) of both human and mouse preimplantation embryos, ...
متن کاملLEAP: constructing gene co-expression networks for single-cell RNA-sequencing data using pseudotime ordering
Summary To construct gene co-expression networks based on single-cell RNA-Sequencing data, we present an algorithm called LEAP, which utilizes the estimated pseudotime of the cells to find gene co-expression that involves time delay. Availability and Implementation R package LEAP available on CRAN. Contact [email protected]. Supplementary information Supplementary data are available at Bioinf...
متن کاملA sparse differential clustering algorithm for tracing cell type changes via single-cell RNA-sequencing data
Cell types in cell populations change as the condition changes: some cell types die out, new cell types may emerge and surviving cell types evolve to adapt to the new condition. Using single-cell RNA-sequencing data that measure the gene expression of cells before and after the condition change, we propose an algorithm, SparseDC, which identifies cell types, traces their changes across conditio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Microbiology research
سال: 2021
ISSN: ['2036-7473', '2036-7481']
DOI: https://doi.org/10.3390/microbiolres12020022